^law Sequence Listing Erro^ummary 



ERROR DETECTED SUGGESTED CORRECTION 



1 — — TT — f 

SERIAL NUMBER 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



8 



Wrapped Nucleics 



Wrapped Aminos 



The numberAext at the end of each line "wrapped" down to the next line. 
This may occur if your file was retrieved In a word processor after creating it. 
Please adjust your right margin to .3, as this wilt prevent Vrapping". 

The amino acid numberrtext at the end of each line "wrapped * down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "\vrapping". 



Incor/ect Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 

Misaligned Amino Acid The numbering under each 5th amino acid is misaligned. This may be caused by the" use of tabs 
Numbering between the numbering. It is recommended to delete any tabs and use spacingfoetween- the numbers. 



Non-ASCII 



Variable Length 



Patentln ver. 2.0 "bug* 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 



Sequence(s) 



contain n's or Xaa's which represented more than one residue. 



As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from (he 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS') 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
t400> sequence id number 
000 



10 



Use of n's'or Xaa's 
(NEW f&L£S)' 



Use of n's and/or JCaa's have been detected in the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



11 



12 



J 



Use of <2l3>Organism Sequence(s) 
(NEW RULES) 



are missing this mandatory field or its response. 



Use of <220>Feature 
(NEW RULES) 



f are missin 



Sequence(s) 1 * ' are missing the <220>Feature and associated headings. 
Use of <220> to <223> is MANDATORY if <213>ORGANtSlvf is "Artificial" or "Unknown" 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol 63", No. 104, pp. 29631-32) (Sec. 1 .823 of new Rules) 



13 



Patentln ver. 2.0 "bug" 



Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 
Ttte'TestJIting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 
Instead, please use "File Manager" or any other means to copy file to floppy disk. 



AKS- Biotechnology Systems Branch- 5/15/99 
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RAW SEQUENCE LISTING DATE: 03/28/2001 

PATENT APPLICATION: US/09/4 24 , 4 87A TIME: 15:24:58 

Input Set : N:\Crf3\Refhold\I424487A.raw DoeS Not Comply 

output set: N:\cRF3\o3282ooi\i424487A.raw Corrected Diskette Needed 



4t 



2 



1 <110> APPLICANT: CHOO, Yen 

2 KLUG, Aaron 

3 ISALAN, Mark 

4 <120> TITLE OF INVENTION: Nucleic Acid Binding Proteins 

5 <130> FILE REFERENCE: 71278/264975 
C--> 6 <140> CURRENT APPLICATION NUMBER: US/09/424 f 4 87A 

7 <141> CURRENT FILING DATE: 2000-02-29 

8 <150> PRIOR APPLICATION NUMBER: GB 9710809.6 

9 <151> PRIOR FILING DATE: 1997-05-23 

10 <150> PRIOR APPLICATION NUMBER: PCT/GB9 8/0 1512 

11 <151> PRIOR FILING DATE: 1998-05-26 

12 <160> NUMBER OF SEQ ID NOS : 17 

13 <170> SOFTWARE: Patentln Ver . 2.1 

15 <210> SEQ ID NO: 1 

16 <211> LENGTH: 264 

17 <212> TYPE: DNA 

18 <213> ORGANISM: Artificial Sequence 

19 <220> FEATURE: 

20 <221> NAME/KEY: CDS 

21 <222> LOCATION: (1)..(264) 

22 <223> OTHER INFORMATION: Description of Artificial Sequence: encoding 

23 nucleic acid binding proteins 

24 <400> SEQUENCE: 1 

25 gca gaa gag aag cct ttt cag tgt cga ate tgc atg cgt aac ttc age 48 

26 Ala Glu Glu Lys Pro Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser 

27 1 5 10 15 

28 gat cgt act act ctt acc cgc cac acg agg acc cac aca ggc gag aag 96 

29 Asp Arg Thr Thr Leu Thr Arg His Thr Arg Thr His Thr Gly Glu Lys 

30 20 25 30 

31 cct ttt cag tgt cga ate tgc atg cgt aac ttc age agg age gat aac 144 

32 Pro Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser Arg Ser Asp Asn 

33 35 40 45 

34 ctt acg aga cac eta agg acc cac aca ggc gag aag cct ttt cag tgt 192 

35 Leu Thr Arg His Leu Arg Thr His Thr Gly Glu Lys Pro Phe Gin Cys 

36 50 55 60 

37 cga ate tgc atg cgt aac ttc agg caa get gat cat ctt caa gag cac 24 0 

38 Arg lie Cys Met Arg Asn Phe Arg Gin Ala Asp His Leu Gin Glu His 

39 65 70 75 80 

40 eta aag acc cac aca ggc gag aag 

41 Leu Lys Thr His Thr Gly Glu Lys 

42 85 
44 <210> SEQ ID NO: 2 
4 5 <211> LENGTH: 88 

46 <212> TYPE: PRT 

47 <213> ORGANISM: Artificial Sequence 
4 8 <220> FEATURE: 

49 <223> OTHER INFORMATION: Description of Artificial Sequence: encoding 




264 
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RAW SEQUENCE LISTING DATE: 03/28/2001 

PATENT APPLICATION; US/09/424 , 487A TIME: 15:24:58 

Input Set : N:\Crf3\Refhold\I424487A.raw 
Output Set: N:\CRF3\03282001\l424487A.raw 



50 nucleic acid binding proteins 

51 <400> SEQUENCE: 2 



52 


Ala 


Glu 


Glu 


Lys 


Pro 


Phe 


Gin 


Cys 


Arg 


He 


Cys 


Met 


Arg 


Asn 


Phe 


Ser 


53 


1 








5 










10 










15 




54 


Asp 


Arg 


Thr 


Thr 


Leu 


Thr 


Arg 


His 


Thr 


Arg 


Thr 


His 


Thr 


Gly 


Glu 


Lys 


55 








20 










25 










30 






56 


Pro 


Phe 


Gin 


Cys 


Arg 


He 


Cys 


Met 


Arg 


Asn 


Phe 


Ser 


Arg 


Ser 


Asp 


Asn 


57 






35 










40 










45 








58 


Leu 


Thr 


Arg 


His 


Leu 


Arg 


Thr 


His 


Thr 


Gly 


Glu 


Lys 


Pro 


Phe 


Gin 


Cys 


59 




50 










55 










60 










60 


Arg 


He 


Cys 


Met 


Arg 


Asn 


Phe 


Arg 


Gin 


Ala 


Asp 


His 


Leu 


Gin 


Glu 


His 


61 


65 










70 










75 










80 


62 


Leu 


Lys 


Thr 


His 


Thr 


Gly 


Glu 


Lys 


















63 










85 
























65 


<210> SEQ 


ID NO: 3 





























66 <211> LENGTH: 18 

67 <212> TYPE: PRT 

68 <213> ORGANISM: Artificial Sequence 

69 <220> FEATURE: 

70 <223> OTHER INFORMATION: Description of Artificial Sequence: Nucleic acid 

71 binding protein 

72 <221> NAME/KEY: BINDING 

73 <222> LOCATION: (1)..(18) 

74 <22 3> OTHER INFORMATION: where X is any amino acid 

75 <4 00> SEQUENCE: 3 
Xaa Cys Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa His Xaa Xaa 

15 10 15 

W--> 78 Xaa His 

80 <210> SEQ ID NO: 4 



tips 



81<211> LENGTH: 21 l ^Tj^ Atff" Jj^Z4^ 






82 <212> TYPE: PRT 

83 <213> ORGANISM [(Artificial Sequence 

84 <220> FEATURE: * ■ - ^ r 

85 <223> OTHER INFORMATION: Description of Artificial Sequence :fwhere X is any - „ 

86 ^amino acid]) a _ ^ 0 ~ * tA ~ 0 



87 <221> NAME/KEY 

88 <222> LOCATION 

89 <400> SEQUENCE 




S 1} - (21) jJgvJM CUV Jmju J ftrkfic+J 

a Xaa Cvs Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa / 



BINDING 



4 

W--^ 90 Xaa Cys Xaa Xaa Cys Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa 

91 1 5 10 15 <Tt9yp 

92 His Xaa Xaa Xaa His ^Aj^r<JL.* 

93 20 a U y 



95 <210> SEQ ID NO: 5 JUo ^ufcU^ (P\ 



96 <211> LENGTH: 26 

97 <212> TYPE: PRT 

98 <213> ORGANISM: Artificial Sequence 

99 <220> FEATURE: 

100 <223> OTHER INFORMATION: Description of Artificial Sequence: Consensus zinc 

101 finger structure 



i 



t 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/424 r 487A 



DATE: 03/28/2001 
TIME: 15:24:58 



W--> 



W--> 



102 
103 
104 
105 
106 
107 
108 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 
152 
153 



<221> 
<222> 
<400> 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<221> 
<222> 
<400> 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



<221> 
<222> 
<400> 



Input Set : N:\Crf3\Refhold\I424487A.raw 
Output Set: N:\CRF3\03282001\I424487A.raw 



NAME/KEY 
LOCATION 
SEQUENCE 



BINDING 
(1) • • (26) 
5 

Pro Tyr Lys Cys Pro Glu Cys Gly Lys Ser Phe Ser Gin Lys Ser Asp 

15 10 15 

Leu Val Lys His Gin Arg Thr His Thr Gly 

20 25 

SEQ ID NO: 6 
LENGTH: 29 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: Consensus 
zinc finger structure 



NAME/KEY 
LOCATION 
SEQUENCE 



BINDING 
(1) • . (29) 
6 

Pro Tyr Lys Cys Ser Glu Cys Gly Lys Ala Phe Ser Gin Lys Ser Asn 

15 10 15 

Leu Thr Arg His Gin Arg lie His Thr Gly Glu Lys Pro 

20 25 

SEQ ID NO: 7 
LENGTH : 9 
TYPE: PR: 






ORGANISM ^Artificial Sequence 
FEATURE : 

QTHF g_ INFORMATION ; n^ ftnr i pi-j o e of Artif i ciaj^Se quence -.^ Jfhere x 
denotes a given combination of the bases at 
interface between DNA subsites, and the four bases 

are equally rep resented at DNA position 3 

NAME/KEY: BINDTftG — " "~ 

LOCATION: (1) . . (9) 
SEQUENCE: 7 

Gly Asn Xaa Xaa Cys Gly Gly Cys Gly 

1 5 
SEQ ID NO: 8 



LENGTH : 9 
TYPE: PRT 




ORGANISM^ Artificial Sequen 
FEATURE : ^ 




<210> 
<211> 
<212> 
<213> 
<220> 

'l?'*** JW*™* TKrenpMflTTO^; Description of Artificial Sequence: jfrhere X 



denotes a known combination of the two bases at 
DNA positions 4X and 5X and there is equal 
probability of any of the four bases at DNA position 3 

<221>*T ?AME/KEY: BINDING ' ■ 

<222> LOCATION: (1)..(9) 
<400> SEQUENCE: 8 

Gly Asn Xaa Xaa Cys Gly Gly Cys Gly 
1 5 



has*- . 




* 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/424 , 487A 



DATE: 03/28/2001 
TIME: 15:24:58 



W--> 



155 

156 

157 

158 

159 

160 

161 

162 

163 

164 

165 

166 

167 

169 

170 

171 

172 

173 

174 

175 

176 

177 

178 

179 

180 

181 

182 

184 

185 

186 

187 

188 

189 

190 

191 

192 

193 

194 

195 

196 

197 

199 

200 

201 

202 

203 

204 

205 

206 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



<221> 
<222> 
<400> 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<221> 
<222> 
<400> 



Input Set : N:\Crf3\Refhold\I424487A.raw 
Output Set: N:\CRF3\03282001\l424487A.raw 

SEQ ID NO: 9 
LENGTH : 9 
TYPE: PR 
ORGAN IS 
FEATURE 

'HFR TNFm?MATJ: nM. nn^r j r t i ^n pf Artif icial Se quence:/ where X 
'denotes a known combination of the two bases at 

itions 4X and 5X 
NAME/KEY 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<221> 
<222> 
<400> 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 

<221> 





LOCATION 
SEQUENCE 



BINDING 

(1) • • (9) 
9 

Gly Cys Xaa Xaa Cys Gly Gly Cys Gly 

1 5 
SEQ ID NO: 10 
LENGTH: 28 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: Zinc finger 
binding protein 




NAME/KEY 
LOCATION 
SEQUENCE 



BINDING 
(1) . . (28) 
10 

Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser Asp Arg Ser Ser Leu 

15 10 15 

Thr Arg His Thr Arg Thr His Thr Gly Glu Lys Pro 

20 25 

SEQ ID NO: 11 
LENGTH: 28 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: Zinc finger 
binding protein 



NAME/KEY 
LOCATION 
SEQUENCE 



BINDING 
(1) • . (28) 
11 

Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser Asp Arg Ser His Leu 

15 10 15 

Thr Arg His Thr Arg Thr His Thr Gly Glu Lys Pro 

20 25 

SEQ ID NO: 12 
LENGTH: 27 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: Zinc finger 
binding protein 
NAME/KEY: BINDING 
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RAW SEQUENCE LISTING DATE: 03/28/2001 

PATENT APPLICATION: US/09/424 , 487A TIME: 15:24:58 

Input Set : N:\Crf3\Refhold\I424487A.raw 
Output Set: N:\CRF3\03282001\I424487A.raw 

207 <222> LOCATION: (1)..(27) 

208 <400> SEQUENCE: 12 

209 Phe Gin Cys Arg lie Cys Met Arg Asn Phe Ser Asp Arg Ser Asn Leu 

210 15 10 15 

211 Thr Arg His Thr Arg Thr His Thr Gly Glu Lys 

212 20 25 

214 <210> SEQ ID NO: 13 

215 <211> LENGTH: 9 

216 <212> TYPE: PRT 

217 <213> ORGANISM: Artificial Sequence 

218 <220> FEATURE: 

219 <223> OTHER INFORMATION: Description of Artificial Sequence: Zinc finger 

220 binding protein 

221 <221> NAME/KEY: BINDING 

222 <222> LOCATION: (1)..(9) 

223 <400> SEQUENCE: 13 

224 Ala Gly Ala Gly Ala Gly Cys Thr Cys 

225 1 5 

227 <210> SEQ ID NO: 14 

228 <211> LENGTH: 8 

229 <212> TYPE: PRT 

230 <213> ORGANISM: Artificial Sequence 

231 <220> FEATURE: 

232 <223> OTHER INFORMATION: Description of Artificial Sequence: 8bp 
23 3 palindromic sequence which is bound and cleaved by 

234 the restriction endonuclease Notl 

23 5 <2 21> NAME/KEY: BINDING 

236 <222> LOCATION: (1)..(8) 

237 <400> SEQUENCE: 14 

238 Gly Cys Gly Gly Cys Cys Gly Cys 

239 1 5 

241 <210> SEQ ID NO: 15 

242 <211> LENGTH: 9 

243 <212> TYPE: PRT 

244 <213> ORGANISM: Artificial Sequence 

245 <220> FEATURE: 

246 <223> OTHER INFORMATION: Description of Artificial Sequence: zinc finger 

247 binding protein 

248 <221> NAME/KEY: BINDING 

249 <222> LOCATION : (1)..(9) 

250 <400> SEQUENCE: 15 

251 Gly Cys Gly Gly Cys Cys Gly Cys Gly 

252 1 5 

254 <210> SEQ ID NO: 16 

255 <211> LENGTH: 8 

256 <212> TYPE: PRT 

257 <213> ORGANISM: Artificial Sequence 

258 <220> FEATURE: 

259 <223> OTHER INFORMATION: Description of Artificial Sequence: Notl 
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VERIFICATION SUMMARY DATE: 03/28/2001 

PATENT APPLICATION: US/09/424 , 4 8 7A TIME: 15:24:59 

Input Set : N:\Crf3\Refhold\I4244 87A.raw 
Output Set: N:\CRF3\03282001\l424487A.raw 

L:6 M:270 C: Current Application Number differs, Wrong Format 



L: 


76 M:341 W: 


(46) 


"n" 


or " 


Xaa" 


used. 


for SEQ ID#: 


3 


L: 


78 M:341 W: 


(46) 


"n" 


or " 


Xaa" 


used, 


for SEQ ID#: 


3 


L: 


90 M:341 W: 


(46) 


"n" 


or " 


Xaa" 


used, 


for SEQ ID#: 


4 


L: 


92 M:341 W: 


(46) 


"n" 


or " 


Xaa" 


used, 


for SEQ ID# : 


4 


L: 


137 M:341 W: 


(46) 


"n 


" or 


"Xaa 


" used, 


for SEQ ID# 


:7 


L: 


152 M:341 W: 


(46) 


"n 


" or 


"Xaa 


" used, 


for SEQ ID# 


: 8 


L: 


166 M:341 W: 


(46) 


"n 


" or 


"Xaa 


" used, 


for SEQ ID# 


: 9 



